Cochannel speech separation using multi-pitch estimation and model based voiced sequential grouping
نویسندگان
چکیده
In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.
منابع مشابه
Monaural Voiced Speech Separation with Multipitch Tracking
Separating voiced speech from its mixtures with interferences in monaural condition is not only an important but also challenging task. As multipitch tracking can enable much better performance of speech separation for CASA systems, we propose a new multipitch determination algorithm, which can be used under various kinds of noise conditions. In the process of multipitch estimation, a new repre...
متن کاملTitle of Document : MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING
Title of Document: MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING Vijay Mahadevan, Master of Science, 2010 Directed By: Dr. Carol Y. Espy-Wilson Department of Electrical and Computer Engineering The aim of the work presented in this thesis is to automatically extract the fundamental frequency of a periodic signal from noisy observations, a task commonly referred to as pitch estim...
متن کاملSingle channel speech separation in modulation frequency domain based on a novel pitch range estimation method
Computational Auditory Scene Analysis (CASA) has been the focus in recent literature for speech separation from monaural mixtures. The performance of current CASA systems on voiced speech separation strictly depends on the robustness of the algorithm used for pitch frequency estimation. We propose a new system that estimates pitch (frequency) range of a target utterance and separates voiced por...
متن کاملSeparation of speech signals using iterative multi-pitch analysis and prediction
A model for multi-pitch analysis is extended into an iterative multi-pitch analysis and prediction (IMPAP) scheme. The method is efficient in finding harmonic complex tones, such as voiced speech signals, in a mixture of such signals and possible noise background. It can also be used to separate the signal into perceptually relevant speech components. The method may be used in applications rang...
متن کاملSource-Filter-Based Single-Channel Speech Separation Using Pitch Information
In this paper, we investigate the source–filter-based approach for single-channel speech separation. We incorporate source-driven aspects by multi-pitch estimation in the model-driven method. For multi-pitch estimation, the factorial HMM is utilized. For modeling the vocal tract filters either vector quantization (VQ) or non-negative matrix factorization are considered. For both methods, the fi...
متن کامل